Pass-Fail Testing: Statistical Requirements and Interpretations

نویسندگان

David Gilliam

Stefan Leigh

Andrew Rukhin

William Strawderman

چکیده

Performance standards for detector systems often include requirements for probability of detection and probability of false alarm at a specified level of statistical confidence. This paper reviews the accepted definitions of confidence level and of critical value. It describes the testing requirements for establishing either of these probabilities at a desired confidence level. These requirements are computable in terms of functions that are readily available in statistical software packages and general spreadsheet applications. The statistical interpretations of the critical values are discussed. A table is included for illustration, and a plot is presented showing the minimum required numbers of pass-fail tests. The results given here are applicable to one-sided testing of any system with performance characteristics conforming to a binomial distribution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bioequivalence Approach for Whole Effluent Toxicity Testing

Increased use of whole effluent toxicity (WET) tests in the regulatory arena has brought increased concern over the statistical analysis of WET test data and the determination of toxicity. One concern is the issue of statistical power. A number of WET tests may pass the current hypothesis test approach because they lack statistical power to detect relevant toxic effects because of large within-...

متن کامل

Evaluation of Statistical Outlier Rejection Methods for IDDQ Testing

The quiescent current testing (IDDQ testing) for CMOS ICs provides several advantages over other testing methods. However, the future of IDDQ testing is threatened by increased sub-threshold leakage current for new technologies. The conventional pass/fail limit setting methodology cannot survive in its present form. In this paper we evaluate two statistical outlier rejection methods – the Chauv...

متن کامل

Measuring Hospital Performance Using Mortality Rates: An Alternative to the RAMR

Background The risk-adjusted mortality rate (RAMR) is used widely by healthcare agencies to evaluate hospital performance. The RAMR is insensitive to case volume and requires a confidence interval for proper interpretation, which results in a hypothesis testing framework. Unfamiliarity with hypothesis testing can lead to erroneous interpretations by the public and other stakeholders. We argue t...

متن کامل

The reliability of the pass/fail decision for assessments comprised of multiple components

OBJECTIVE The decision having the most serious consequences for a student taking an assessment is the one to pass or fail that student. For this reason, the reliability of the pass/fail decision must be determined for high quality assessments, just as the measurement reliability of the point values. Assessments in a particular subject (graded course credit) are often composed of multiple compon...

متن کامل

Translation Evaluation in Educational Settings for Training Purposes

The following article describes different methods and techniques used in educational settings for translation evaluation. Translation evaluation is the placing of value on a translation i.e. awarding a mark, even if only a binary pass/fail one. In the present study, different features of the texts chosen for evaluation were firstly considered and then scoring the t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 114 شماره

صفحات -

تاریخ انتشار 2009

Pass-Fail Testing: Statistical Requirements and Interpretations

نویسندگان

چکیده

منابع مشابه

Bioequivalence Approach for Whole Effluent Toxicity Testing

Evaluation of Statistical Outlier Rejection Methods for IDDQ Testing

Measuring Hospital Performance Using Mortality Rates: An Alternative to the RAMR

The reliability of the pass/fail decision for assessments comprised of multiple components

Translation Evaluation in Educational Settings for Training Purposes

عنوان ژورنال:

اشتراک گذاری